1 Annals of 

human genetics 



doi: 10.1111/ahg. 12033 

C90RF72 Intermediate Repeat Copies Are a Significant Risk 
Factor for Parkinson Disease 

Karen Nuytemans 1 ' 1 , Guney Bademci 1,2 ' 1 , Martin M. Kohli 1 , Gary W. Beecham 13 , Liyong Wang 1,3 , 
Juan I. Young 1 ' 3 , Fatta Nahab 4 , Eden R. Martin 13 , John R. Gilbert 13 , Michael Benatar 4 , 
Jonathan L. Haines 5 , William K. Scott 13 , Stephan Zuchner 13 , Margaret A. Pericak-Vance 13 and 
Jeffery M. Vance 1 ' 3 * 

1 University of Miami, Miller School of Medicine, John P. Hussman Institute for Human Genomics, Biomedical Research building, 1501 
NW 10 th Ave, Miami, FL 33136, USA 

2 Kanuni Training and Research Hospital, Clinical Genetics Lab, Kasustu, Trabzon 61290, Turkey 

3 University of Miami, Miller School of Medicine, Dr. John T. Macdonald Foundation Department of Human Genetics, Miami, FL 
33136, USA 

4 University of Miami, Miller School of Medicine, Department of Neurology , Clinical Research Building, 1120 NW 14 Street, Miami, 
FL 33136, USA 

5 Center for Human Genetics Research, Vanderbilt University Medical Center, 519 Light Hall, 2215 Garland Ave, South Nashville, TN 
31232, USA 



Summary 

We set out to determine whether expansions in the C90RF72 repeat found in amyotrophic lateral sclerosis (ALS) and 
frontotemporal dementia (FTD) families are associated with Parkinson disease (PD). We determined the repeat size in a 
total of 889 clinically ascertained patients (including PD and essential tremor plus Parkinsonism (ETP)) and 1144 controls 
using a repeat-primed PCR assay. We found that large C90RF72 repeat expansions (>30 repeats) were not contributing 
to PD risk. However, PD and ETP cases had a significant increase in intermediate (>20 to 30+) repeat copies compared 
to controls. Overall, 14 cases (13 PD, 1 ETP) and three controls had >20 repeat copies (Fisher's exact test p = 0.002). 
Further, seven cases and no controls had >23 repeat copies (p = 0.003). Our results suggest that intermediate copy 
numbers of the C90RF72 repeat contribute to risk for PD and ETP. This also suggests that PD, ALS and FTD share 
some pathophysiological mechanisms of disease. Further studies are needed to elucidate the contribution of the C90RF72 
repeat in the overall PD population and to determine whether other common genetic risk factors exist between these 
neurodegenerative disorders. 
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Introduction 

Parkinson disease (PD) is a neurodegenerative movement dis- 
order characterized by bradykinesia, rigidity, postural instabil- 
ity and tremor. PD prevalence increases with age, with esti- 
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mates of 4— 5% of the population being affected at 85 years and 
older (Fahn, 2003; Tansey et al., 2007). While the diagnosis of 
"idiopathic" PD is well established, there are many disorders 
that contain the essential elements of the PD phenotype, often 
termed Parkinsonism, such as progressive supranuclear palsy 
(PSP) and multiple system atrophy (MSA). Wider overlap also 
occurs, including frontotemporal dementia with Parkinson- 
ism (FTDP-17) and Lewy Body Dementia (Wszolek et al., 
2006). However, minimal clinical overlap has been reported 
between PD and amyotrophic lateral sclerosis (ALS), a neu- 
rodegenerative disorder characterized by progressive muscular 
paralysis reflecting degeneration of the motor neuron in the 
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primary motor cortex, brainstem and spinal cord (Wijesekera 
and Leigh, 2009). Previous observations have linked Parkin- 
sonism with motor neuron disease (Brait— Fahn disease) or 
the Guam PD-ALS-dementia complex. This raised the hy- 
pothesis that ALS and PD might be related, as members of a 
spectrum of phenotypes with shared pathophysiological ele- 
ments (Brait et al, 1973; Uitti et al, 1995; Williams et al, 
1995; Qureshi et al, 1996; Pinkhardt et al., 2009). However, 
consumption of the unusual amino acid BAA has subsequently 
been thought to be the cause of Guam PD-ALS-dementia, 
and the incidence of the disorder in Guam has dropped since 
dietary modifications (Khabazian et al., 2002; Pablo et al., 
2009). Nonetheless, recently, a study has shown that rare ge- 
netic variants in the Angiogenin gene (ANG) contribute to 
the occurrence of both PD and ALS (van Es et al., 2011). 
Mutations in the Valosin Containing Protein gene ( VCP) are 
linked with ALS and also have been postulated to be a risk 
factor for PD, though data reported to date are inconsistent 
(Johnson et al., 2010; Chan et al., 2012; Majounie et al., 
2012c). This raised the hypothesis that these two disorders 
share other common genes or pathways that contribute to the 
development of neurodegeneration. Therefore, when several 
reports recently demonstrated that large expansions of a hex- 
anucleotide repeat in C90RF72 were correlated with the de- 
velopment of ALS (Renton et al., 2011; Dejesus-Hernandez 
et al., 2011; Gijselinck et al., 2012), we set out to examine 
the involvement of this complex repeat in PD. Interestingly, 
several subsequent publications (Boeve et al., 2012; Cooper- 
Knock et al., 2012; Mahoney et al., 2012; Simon-Sanchez 
et al., 2012) report an unexpected high presence of Parkin- 
sonism and PD in ALS and FTD families of C90RF72 repeat 
expansion carriers. Here, we demonstrate that intermediate 
C90RF72 repeat copy numbers (>20— 30+ repeat copies) 
are observed at significantly higher levels in PD patients than 
controls. 

Materials and Methods 

Samples 

Initially, we evaluated 396 unrelated cases with PD (non- 
Hispanic /Latino Caucasians, range of age-at-onset (AAO): 
10—85 year, average AAO: 53.6 year) and 12 cases of essential 
tremor with Parkinsonism (ETP) (Rocca et al., 1998). Patients 
were collected by 1 of 13 ascertainment centres in the PD 
Genetics Collaboration (Scott et al., 2001) or by the Morris 
K. Udall Parkinson Disease Center of Excellence (PI: Vance 
JM) ascertainment core. These participants were recruited 
primarily by participating movement disorder and neurology 
clinics. All individuals with PD and ET were examined by 
a board-certified neurologist. The diagnostic criteria used in 
evaluating ET patients are based on the Movement Disorders 
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Society Consensus Criteria (Deuschl et al., 1998). Individu- 
als are considered possibly affected with ET (types 1 and 2) 
when meeting criteria defined by the Tremor Investigation 
Group (Deuschl et al., 1995). Families with positive family 
history for FTD were excluded from the analysis. All patients 
were screened at ascertainment for cognitive status using the 
3MS or short blessed test (Teng & Chui 1987; Katzman et al., 
1983). Any cases with an onset of dementia within one year 
of the diagnosis of PD were excluded from the analysis. As 
positive controls for the hexanucleotide repeat, we included 
three ALS patients (M. Benatar, personal communication) 
with >50 C90RF72 repeats (Renton et al, 2011). For the 
replication dataset, we used 481 patient DNA samples from 
the NINDS Human Genetics Resource Center DNA and 
Cell Line Repository (http://ccr.coriell.org/ninds). Clinical 
data for these patients are available online. The submitters 
that contributed samples are acknowledged in detailed de- 
scriptions of each panel: NDPT083-085-087-088-089-102- 
104-105. Of the 481 samples reported here, 169 samples were 
reported before by Majounie et al. (2012a). We selected sam- 
ples for whom clinical data were provided by a neurologist 
(Caucasians, range of AAO: 7—87 year, average AAO: 55.3 
year) to minimize the difference in inclusion criteria between 
the two patient groups in the initial and replication datasets. 
Both patient groups have similar percentages of early onset 
age (<50 year; 39.5% versus 35%) and positive family history 
(36% versus 38%). 

We evaluated 427 unaffected controls in the initial 
dataset (non-Hispanic/Latino Caucasians, range of age-at- 
exam (AAE): 60—101 year, average AAE: 77.2 year). In the 
replication analyses, an additional 717 controls (Caucasians, 
range of AAE: 60—100 year, average AAE: 71.7 year) were 
analysed. To lessen the chance of presymptomatic PD pa- 
tients, we utilized controls older than 60 year that were col- 
lected for an ongoing study in Alzheimer disease (MPV PI). 
All controls scored within normal limits on the Modified 
Mini Mental State test and had normal neurological exams at 
age-of-exam. PD symptomology is an exclusion criterion for 
the controls in the Alzheimer disease study. The patients and 
controls have been included in GWAS before (Edwards et al., 
2010) with no significant population stratification differences. 



Genotyping: Repeat-Primed PCR and 
Haplotype SNP Genotyping 

The repeat-primed PCR assay used to screen for the 
(GGGGCC),, repeat expansion limits the maximum num- 
ber of repeats that can be detected to 60. The PCR cycling 
program of Renton et al. ( 2011) was modified to achieve 
more robust results on a Veriti 96-well Fast Thermal Cy- 
cler (Applied Biosystems, Life Technologies, Carlsbad, CA, 
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USA). A custom PCR cycling program was used (4 min at 
94° C; 50 cycles of 1 min at 94° C, 1 mm at 64° C and 2 min 
at 72°C; 10 min at 72°C). Genotypes for the C90RF72 
risk haplotype SNPs (Mok et al., 2012a) were available for 
this analysis from a previous GWAS study (Edwards et al., 
2010) for 10 out of 14 intermediate repeat copy carriers. 
For the remaining four, the haplotype tag rs3849942 was 
genotyped by Sanger sequencing. Primer sequences flanking 
rs3849942 are F; 5'-GCAATTTCCTTGATTTGCAG and 
R; 5'-GGGGGAAAAGAGAAATGGAT. Fragment length 
analysis and Sanger sequencing were performed on an ABI 
3730x1 genetic analyser (Applied Biosystems, Life Tech- 
nologies), and fragment analysis data were analysed using 
GeneMapper software (version 4.0, Applied Biosystems, Life 
Technologies) . 

Southern Blot 

The 5 fig of DNA was digested with EcoRI and Sacl. 
Digested samples were separated on a discontinuous 0.9% 
(4 cm)/1.5% (10 cm) agarose TAE gel, followed by partial 
depurination with HC1 (0.4N) for 15 min and denatura- 
tion in NaOH (0.5N) for 30 min. DNA was transferred in 
20 x standard saline citrate (SSC) to a charged nylon mem- 
brane (Roche Applied Science, Indianapolis, USA) by capil- 
larity. The 1 kb and 100 bp DNA ladders were used as size 
standards. The membranes were cross-linked and hybridized 
overnight at 47° C in roller bottles in Dig Easy Hybridization 
Buffer (Roche Applied Science) with a 241 bp probe synthe- 
sized by PCR from genomic DNA, using primers flanking 
exon 1 of C90RF72 (5'-AGAACAGGACAAGTTGCC- 
3' and 5'-AACACACACCTCCTAAACC-3') labelled with 
Dig-1 1-dUTP by PCR (PCR Dig Synthesis Kit; Roche Ap- 
plied Science). Filters were washed twice for 5 min in 2x 
SSC/1% SDS at room temperature and twice for 15 min in 
1 x SSC/0.1% SDS at 65°C. Filter blocking and signal detec- 
tion were performed by incubating in blocking solution (DIG 
Nucleic Acid Detection Kit; Roche Applied Science) for 30 
min, with the anti-DIG-AP antibody (1:5000 in blocking so- 
lution) for 30 min, and ultimately in the solution containing 
the NBT/BCIP substrate until the colour could be visualized. 
Images were captured with a Sony DSC-W350 digital camera 
and processed by Adobe Photoshop. 

Statistical Methods 

To test whether higher repeat copy numbers increase one's 
risk for PD, we conducted Fisher's exact tests using the a pri- 
ori thresholds of greater than 20 or 23 repeat copies (RCs). 
The common maximum RCs reported in controls in this 
study and others is 23 repeats, while by far the majority 



(~99.4%) of all reported controls have RCs lower than 20 
(Dejesus-Hernandez et al., 2011; Renton et al., 2011; Byrne 
et al, 2012; Cooper-Knock et al, 2012; Chio et al, 2012; 
Daoud et al., 2012; Dejesus-Hernandez et al., 2012; Dobson- 
Stone et al., 2012; Ferrari et al., 2012; Garcia-Redondo et al., 
2012; Gijselinck et al, 2012; Konno et al, 2012; Majounie 
et al, 2012b; Millecamps et al., 2012; Mok et al, 2012b; 
Ogaki et al., 2012; Ratti et al, 2012; Rollinson et al, 2012; 
Rutherford et al., 2012; Sabatelli et al., 2012; Simon-Sanchez 
et al., 2012; van Rheenen et al., 2012; Xi et al, 2012; Yeh 
et al., 2012). Odds ratios represent the effect of interme- 
diate repeat copies relative to lower repeat copies (<20 or 
<23 RCs depending on analysis) and are calculated using a 
2x2 contingency table. To test for overall distributional and 
mean differences, we used the Kolmogorov— Smirnov test and 
a standard <-test, respectively. Linear regression was used to 
test correlation between number of RCs and age-at-onset, 
age-at-exam or family history. To determine whether con- 
founding influenced our pooling analysis, we applied the 
Cochran— Mantel— Haenszel test which performs pooled as- 
sociation analyses while controlling for datasets. All analyses 
were conducted using R software version 2.13.0. p-Values 
of 0.05 or below were considered statistically significant evi- 
dence of association. 



Results 

Scoring of the PCR electropherogram found inconsistencies 
between different researchers when attempting to call one or 
two peaks. Thus, all samples with one or two peaks were 
merged into a single group. To determine the correlation of 
the number of peaks in the repeat-primed PCR assay with 
actual repeat copy number (RC), we sequenced the repeat re- 
gion in 34 samples with variable peak numbers, using the un- 
labeled genotyping primers reported in Dejesus-Hernandez 
et al. (2011). This demonstrated that the number of the actual 
RCs in our study equals the number of peaks plus 2. 

The positive controls (ALS patients) had a maximum num- 
ber of repeats greater than 50. The average maximum number 
of RCs in the initial clinical dataset was 6.9 in the controls 
and 8.2 in the cases (ranging <4 to >30) (Fig. 1). There was 
no significant contribution of repeats >30 in the PD cases. 
However, the PD cases had more intermediate size repeats 
(>20 repeats to 30+) versus controls. As the majority of con- 
trols in the literature are reported to have less than 20 repeats, 
we used this value as a first a priori threshold for normal versus 
intermediate RCs (Dejesus-Hernandez et al., 2011; Renton 
et al, 2011; Millecamps et al, 2012; Sabatelli et al., 2012). 
We found eight PD patients, one ETP patient and one con- 
trol with intermediate RCs (Table 1) in our initial dataset. 
The one-tailed Fisher's exact test in the initial dataset showed 
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Figure 1 Histogram of original dataset. 

Histogram of the maximum number of C90RF72 repeat copies (X axis) for 407 PD and 
ET-with Parkinsonism patients (black) and 427 controls (white). 

Table 1 Clinical information of the patients in the initial dataset with 20 or more repeat copies. 
# of 



Patient 


repeat 
copies 


Gender 


AAO 


AAE 


Family 
history 


Phenotype 


Tremor 


Bradykinesia 


Rigidity 


Gait 

disturbance 


Dementia 


PI 


30 


F 


2() 


97 


Yes 


Essential tremor with 


Kinetic 


Yes-(Mild) 


Yes-(Mild) 


No 


No 














parkinsonism 












P2 


28 


M 


31 


34 


No 


Parkinson disease 


Resting 


Yes 


Yes 


No 


Not assessed 


P3 


25 


F 


33 


34 


Yes 


Parkinsonism with 


Resting 


No 


No 


Yes 


No 














predominant tremor 












P4 


24 


F 


56 


71 


No 


Parkinson disease 


Resting 


Yes 


Yes 


Yes 


No 


PS 


23 


M 


46 


51 


Yes 


Parkinson disease 


Resting 


Yes 


Yes 


Yes 


Yes 


P6 


22 


M 


61 


68 


Yes 


Parkinson disease 


Resting 


Yes 


No 


Yes 


No 


P7 


21 


F 


44 


56 


Yes 


Parkinson disease 


Resting 


No 


Yes 


Yes 


No 


P8 


21 


M 


64 


66 


Yes 


Parkinson disease 


Resting 


Yes 


No 


Yes 


Not assessed 


P9 


21 


M 


53 


53 


No 


Parkinson disease 


Resting 


Yes 


Yes 


Yes 


No 


AAO; as 


je at onset, AAE; 


age-at- 


exam, 


Dementia 


dementia at time of ascertainment. 











significant association of the intermediate RCs with an in- 
creased risk for PD (p = 0.008; two-tailed p = 0.010). The 
odds ratio for the intermediate RCs in this original dataset is 
9.6 [95% CI: 1.32 - 421] (two-tailed). The confidence inter- 
val is very wide, likely due to the rarity of the intermediate 
repeat. Even when using the second, more stringent a priori 
threshold of 23 RCs — the common maximum number of 
repeats reported in controls — marginal significance was ob- 
tained (ratio patients: controls 4:0; one/two-tailed p — 0.05). 
The intermediate RC results in the clinical NINDS repli- 



cation dataset are shown in Figure 2 and Table 2. Patient 
ND09224 was reported before by Majounie et al. (2012a); 
differences in interpretation of the peak pattern might account 
for the variance in reported repeat copies. Again, the num- 
ber of PD cases with intermediate repeats was twice that of 
controls, although the overall number of individuals with in- 
termediate repeats was not large enough to reach significance 
(>20 copies; ratio patientsxontrols 5:2, one-tailed p — 0.09, 
two-tailed p — 0.12; >23 copies; ratio patientsxontrols 3:0, 
one/two-tailed p — 0.06). However, importantly, the pooled 
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Figure 2 Histogram of replication dataset. 

Histogram of the maximum number of C90RF72 repeat copies (X-axis) for 481 PD cases 
(black) and 726 controls (white). 



Table 2 Clinical information of the NINDS patients with 20 or more repeat copies. 



# of 



Patient 


repeat 
copies 


Gender 


AAO 


AAE 


Family 
history 


Phenotype 


Tremor 


Bradykinesia 


Rigidity 


Gait 

disturbance 


Dementia 


ND00236 


>30 


M 


40 


60 


yes 


Parkinson disease 


No 


Yes 


Yes 


Yes 


NA 


ND09224 


26 


F 


68 


77 


no 


Parkinson disease 


Resting 


Yes 


Yes 


Yes 


No 


ND02556 


24 


M 


65 


77 


no 


Parkinson disease 


Resting 


Yes 


Yes 


Yes 


NA 


ND03101 


22 


F 


61 


74 


no 


Parkinson disease 


Resting 


Yes 


Yes 


Yes 


NA 


ND00277 


21 


M 


40 


65 


no 


Parkinson disease 


Resting 


Yes 


Yes 


Yes 


NA 



AAO; age at onset, AAE; age-at-exam, Dementia; dementia at time of ascertainment, Patient ND09224 was reported before by Majounie 
et al (2012a). 



analysis of both clinical datasets (patients N — 889/controls 
N — 1144) (Fig. 3) had more power and obtained a lower 
p-value than seen in either dataset: >20RCs; patientsxontrols 
14:3 (risk OR 6.1 [95% CI: 1.68-33.1] (two-tailed); one- 
tailed p = 0.001, two-tailed p = 0.002), >23RCs; patients 
7: controls 0 (one/two-tailed p — 0.003), indicating that the 
replication dataset strengthens the signal from the original 
dataset. 

Overall 2% of the 889 clinical PD cases had RCs > 20. 
As the frequency of >20RCs carriers in our control groups 
(~0.3%) is lower than the overall frequency in the literature 
(~0.6%), we performed the same association analysis in the 
total dataset adjusting for a higher frequency of >20RCs as 
reported in controls. We still obtained evidence for significant 



association (>20 copies; ratio patientsxontrols 14:'6', p — 
0.015). 

The 20 SNPs reported by Mok et al. (2012a) were 
genotyped in a previous GWAS for 10 out of 14 clinical in- 
termediate repeat copy carriers (Edwards et al., 2010). All 10 
intermediate copy carriers had the alleles of the risk haplotype 
(N— 20), with the exception of the most extreme ends of the 
haplotype in three individuals (Table SI). All intermediate 
copy carriers carried the T allele of SNP rs3849942, which 
"tags" the haplotype as the least frequent allele on the haplo- 
type. Next, we examined this tagSNP in our full PD dataset. 
We found that 95% of the clinical cases and controls with 
greater than 8 repeats carried the tagging T allele versus 10% 
of those individuals with <8 repeats (Fig. 4). This led us to 
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Figure 3 Histogram of combined dataset. 

Histogram of the maximum number of C90RF72 repeat copies (X-axis) for 889 PD 
cases (black) and 1144 controls (white) . 
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Figure 4 Percentages of rs3849942 genotypes corresponding to C90RF72 repeat copy 
carriers. 



test whether there was an overall significant shift in the distri- 
bution of repeat size between cases and controls. In the initial 
clinical dataset, the distribution of the RCs was significantly 
different between cases and controls (KS test p-value = 5.28e- 
07, f-test for means of the distribution p-value = 6.68e-07). 
However, the NINDS dataset did not have a significant 
distribution shift of RCs in PD versus controls or a mean 
difference (KS test^-value = 0.97, <-test p- value = 0.98). A 
comparison of the two control groups showed that there were 
no differences in either the overall distribution or the repeat 
count mean that would account for this difference. While the 
NINDS set by itself was not significant, a combined analysis 
of both datasets showed a significant difference between 
overall distribution and distribution means of RCs between 
cases and controls (KS test p-value = 0.01 /f-test p-value = 
0.01), |>-values being notably less significant than in the orig- 
inal analysis. As the histograms of both case groups showed 



differences in distribution (Figs 1 and 2), we tested whether 
the results of the pooled analysis were due to confounding by 
the datasets. To control for having two different datasets, we 
performed the Cochran— Mantel— Haenszel test (CMH). The 
CMH test is analogous to a / 2 test, but can be used to control 
for differing datasets. The CMH results still showed signifi- 
cant results (p — 0.0064/p = 0.0074 for the two thresholds), 
indicating that results were not due to confounding. 

No evidence was seen for correlation of RCs with AAO 
(p = 0.792) for the clinical cases, or with AAE for the two clin- 
ical case and two control groups (p = 0.8246/p = 0.1624/p = 
0.2513/p = 0.8284, respectively). No correlation of repeat 
length with dementia was observed in the patients of the 
initial dataset, for which data were available (N — 230) (p = 
0.267). RC distributions of PD patients with early (<40 year) 
versus late AAO and positive versus negative family history 
were not significantly different. 
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Figure 5 Segregation analysis in Family 1. 

(A) Pedigree. Symbols: Top right: ET type I; bottom right: possible ET type II; top left: ET by history; fully blackened: 
definite ET. (B) Electropherogram results of repeat-primed PCR of PI, aS and uS. 



Genotype of >20 Repeat Individuals Using 
Southern Blot 

We performed Southern blot analysis to determine whether 
the clinically affected individuals with >20 RCs are heterozy- 
gous or homozygous carriers of two intermediate RC alleles, 
as the repeat-primed PCR for Patient 1 (PI) indicated two 
possible alleles with intermediate repeat copies (Fig. 5B). All 
but two of the intermediate RC carriers of the original dataset 
were shown to be heterozygous for the intermediate allele and 
a low copy number allele (data not shown). The two other 
samples (PI; P9, Fig. 6) were further analysed with additional 
control samples (see below). For Southern blot analysis of the 
NINDS intermediate samples, we obtained additional DNA 
from cultured cells. However, the DNA quality was insuffi- 
cient to obtain a distinct band on Southern blot after multiple 
attempts. 

One sample (PI; lane 4, Fig. 6) was homozygous for 
intermediate repeats on Southern blot. Segregation analysis 
showed three different maximum intermediate RC alleles in 
her two offspring and herself (Fig. 5). Southern blot analysis 
confirmed the repeat-primed PCR results and excluded the 
possibility of the other parent (no sample available) also being 
a carrier of an intermediate RC allele, as both sibs carry low 
copy alleles not originating from PI. This suggests that the 




Figure 6 Southern blot analysis. 

(1) Control, 10 repeat copies (RCs). (2) Control, <4RCs. (3) 
Control <4RCs. (4) Patient 1, 30RCs. (5) Patient 3, 25RCs. 
(6) Unaffected sib (uS) from Faml, 26RCs. (7) Affected sib (aS) 
from Faml, 22RCs. (8) Overflow from lane (7). (9) Control, 
<4RCs. (10) Control, 5RCs. (11) Patient 9, 21RCs. 

allele with 26RC in the unaffected sib is derived from one 
of the alleles in patient PI, implying instability in the repeat. 
By her own history, PI developed ET at the age of 20. For 
this study, the patient underwent a neurological exam when 
she was 97 year old. She presented with mild bradykinesia 
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and rigidity, moderate postural and intention tremor in both 
upper extremities. 

Thirteen of the intermediate patients with >20 RCs had 
a phenotypic presentation that was compatible with classical 
PD (Tables 1 and 2). Eleven out of 14 patients have data avail- 
able on levodopa response; all are responsive to L-dopa. A 
family history for ALS was not ascertained in these interme- 
diate families. The intermediate RCs segregated genetically 
in a stable manner in several additional families. However, 
the intermediate alleles did not always segregate with PD 
(Figs S1-S8). 



Discussion 

Renton et al. suggested pathogenic alleles for the C90RF72 
repeat in ALS to be more than 30 RCs and benign alleles 
fewer than 20 RCs (Renton et al., 2011). Subsequent re- 
ports have identified rare controls in the 20 to 30RC range 
and more (Dejesus-Hernandez et al., 2011; Gijselinck et al., 
2012; Simon-Sanchez et al., 2012), with the most commonly 
reported maximum RC number in controls as 23. We demon- 
strate here that intermediate expansions of the C90RF72 
gene (>20RCs) are associated with increased risk for clini- 
cally diagnosed PD. Moreover, combining two different clin- 
ical datasets, we found seven PD but no controls having over 
23RCs. Six more reports have identified clinical PD patients 
with intermediate/ expanded RCs (Dejesus-Hernandez et al., 
2012; Majounie et al, 2012a; Xi et al, 2012; Daoud et al, 
2013; Harms et al., 2013; Lesage et al., 2013), with frequen- 
cies up to 2% amongst cases, similar to the work presented 
here. For five out of six reports, however, no risk assessment 
for the C90RF72 gene and PD relative to controls was per- 
formed. In the sixth study (Dejesus-Hernandez et al., 2012), 
a lack of association with increased PD risk was reported. 
However, in contrast to the other reports, no intermediate 
repeat copies above 23 were identified in the PD patients. 
For single allele analysis, they used cutoffs based on sample 
quintiles and percentiles, so no meaningful comparison can 
be evaluated with this report. However, it is interesting to note 
that the 95 th percentile (over 13 RCs) was the group with the 
highest increased odds ratio, though it was not significant. 

We confirmed the presence of the tagging allele (T) of 
rs3849942 in 100% of all intermediate carriers and the full 
previously reported haplotype in all carriers where genotype 
data were available. In addition, the tagging allele was in 95% 
of all individuals with eight repeats or greater, versus 10% in 
individuals with less than 8 RC, which is consistent with a 
recent report by van der Zee et al. (2013). These data indicate 
that genotyping this SNP could be used as a screening test 
to identify samples with possible higher repeat copy num- 
bers. The reasons for this correlation are unknown, but it 



could be due to a specific haplotype background that con- 
tributes to the expansion. Alternatively, the initial expansion 
may have occurred only once on this background, leading to 
further expansion. Our previously reported GWAS (Edwards 
et al., 2010) showed no evidence for association with PD for 
rs3849942 {p = 0.852), indicating that the effect we report 
here is specific to the repeat, not the underlying haplotype. 

Interestingly, several studies have suggested an excess of 
clinical PD or Parkinsonism cases in C90RF72 repeat ex- 
pansion families (Boeve et al., 2012; Cooper-Knock et al., 
2012; Mahoney et al, 2012; Simon-Sanchez et al, 2012). 
This finding could be explained if, indeed, the intermediate 
allele described here is a "premutation" allele for ALS. The 
clinical phenotype of the majority of the patients with greater 
than 20 RCs appears to be consistent with "typical" PD with 
no clinical evidence of motor neuron disease. Furthermore, 
we saw no evidence for ALS in our family histories. As ALS 
is much less frequent than PD, if such a relationship existed, 
it is more likely to be observed in the rarer ALS families than 
vice versa. The PD patients with >30 RCs did not have symp- 
toms observed (Udall patient) or reported (NINDS patient) 
indicative of ALS or FTD Two other reports of intermedi- 
ate/expanded repeats in clinical PD cases, however, indicate 
that some of these carriers present with a family history of 
atypical Parkinsonism or other neurodegenerative disorders 
(AD, ALS) (Xi et al, 2012; Lesage et al., 2013). 

The lack of segregation of the intermediate RC allele 
with clinical PD in the small number of families studied 
here (with asymptomatic carriers at an age that exceeds the 
AAO of their affected relative) and the occurrence of inter- 
mediate/ expanded repeats in some controls suggests that it is 
most likely a susceptibility risk factor, rather than causal for 
PD /Parkins onism . 

Although the research to elucidate the disease mechanism 
of this intermediate/expanded repeat is still in its infancy, the 
first reports suggested binding of the repeat to other RNAs, 
sequestering both from normal processing (Renton et al., 
2011), inclusion of the repeat in nuclear foci, also supporting 
RNA binding as a possible mechanism, and reduced protein 
expression and affected splicing (Dejesus-Hernandez et al., 
2011). Subsequent functional studies report on the presence 
of di-peptides — translated from the repeat in the mRNA 
through repeat associated non-ATG (RAN) translation — in 
the pathology of large, expanded repeat carriers (Ash et al., 
2013; Mori et al., 2013). This RAN translation is possible 
due to formation of stable secondary structures, hairpins or 
G-quadruplex structures, by the repeat in the mRNA (Fratta 
et al., 2012; Reddy et al., 2013). Additionally, analyses of 
the effect of the repeat in cell and Drosophila models indi- 
cated that repeat expression (30 RCs used in the report) is 
sufficient to cause neurodegeneration (Xu et al., 2013). The 
authors identified one RNA-binding protein (Pur a) binding 
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to the repeat in a concentration-dependent manner. Overex- 
pression of Pur a seemed to rescue the phenotype caused by 
the repeat, supporting the hypothesis of RNA sequestration as 
a possible disease mechanism. It has been shown that multiple 
other genes contributing to ALS risk (FUS, TDP43, ANG) 
are involved in RNA metabolism as well (Emara et al., 2010; 
Lagier-Tourenne et al., 2010). 

The finding of an abnormally expanded number of re- 
peats in PD, but fewer than that observed in ALS, is rem- 
iniscent of Fragile X-associated Tremor/ Ataxia Syndrome 
(FXTAS)/ Fragile X (Garcia-Arocena & Hagerman 2010) and 
ATXN2 (SCA2) (Kim et al, 2007). In these disorders, inter- 
mediate RCs (premutations) are associated with Parkinson- 
ism. The mechanism proposed for FXTAS is that the repeat, 
also G-C rich, binds local RNA, creating local cell stress and 
damage. Interestingly, ATXN2 has also been reported to be 
involved in RNA metabolism (Nonhoff et al., 2007), so this 
is another area of common overlap. 

Finally, initial evidence suggested that the repeat may be- 
come unstable with increasing numbers (Renton et al., 201 1). 
We did observe instability of the repeat in blood between pa- 
tient 1 and her offspring with a loss of repeat copies. Repeats 
of intermediate length are reported to be prone to both gain- 
ing and losing copies as opposed to longer repeats whose 
length mostly increases (Li et al., 2002). The instability of the 
intermediate repeats might allow them to reach a threshold 
where further elongation to pathogenic lengths is more likely. 

In summary, these data suggest that not only do the 
C90RF72 repeats contribute to significant risk of developing 
PD/Parkinsonism, but that abnormal RNA metabolism may 
also be an important pathogenesis factor in PD as well, and 
should be further investigated. 
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